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Abstract. We give a combinatorial interpretation of a classical meta-Fibonacci sequence 
defined by G{n) = n — G(G(n — 1)) with the initial condition G(l) = 1, which appears 
in Hofstadter's "Godel, Escher, Bach: An Eternal Golden Braid". The interpretation 
is in terms of an infinite labelled tree. We then show a couple of corollaries about the 
behaviour of the sequence G(n) directly from the interpretation. 



1. Introduction 

In his famous book Godel, Escher, Bach: An Eternal Golden Braid, Douglas Hofstadter 
introduced his G-sequence @] p. 137] defined as 

G(n) =n — G(G(n - 1)) ; G(l) = 1. (1.1) 

This recursion is part of the general family of recursions given by G(n) = n — G(G k (n — 1)) 
with initial condition G(l) = 1. The superscript of k > 1 means a A;-fold composition 
of the function G(n). Recursions of this form, where the argument of the defining terms 
depend on previous values of the recursive function, are called meta-Fibonacci or nested 
recursions. There is knowledge about Hofstadter's G-sequence in literature nowadays, but 
little is known about the other fc-fold recursions above. 

Let F n denote the Fibonacci numbers, defined by F n = F n _i + F n _2 and Fi = Fi = 1. 
Meek and Van Rees [5] showed that if n = F ri + • • • + F r . is the Zeckendorf representation^ 
of n then G(n) = F ri -± + • • • + Soon both Granville and Rasson [3], and Downey 

and Griswold [2] showed that G(n) = \_(n + 1)(/> _1 J where 4> = is the golden ratio. 

Our result is an interpretation for G(n) in terms of counting labels in an infinite labelled 
tree. After our discovery we learned that the result was known by some in meta-Fibonacci 
circles, but to the best of our knowledge there is no published proof of this combinatorial 
interpretation in literature. The purpose of this paper is not only to give a proof of the 
interpretation, but also to provide motivation for adapting this approach in order to find 
a combinatorial interpretation of the related fc-fold recursion above. 
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2. Combinatorial interpretation of the recursion G(n) 

Throughout this section we will refer to Hofstadter's G-sequence as defined in (jl.lj) 
as G(n). Table [27T1 contains the first 20 values of G(n). Notice from this table that the 
difference between a term of G(n) from the previous term is always or 1. Such a sequence 
of positive integers is called slow-growing. It is not difficult to show via induction that G(n) 
remains slow-growing for all n. More importantly, let f(n) = # G _1 ({n}) for n > 1, which 
we call the frequency sequence of G(n). The frequency sequence displays many patterns (see 
Table l2~2j) . for example, it consists only of Is and 2s. In fact, let us interpret the frequency 
sequence as an infinite word W = II^Li /( n ) where the product stands for concatenation. 
Setting w\ = 2,W2 = l,n>3 = 2, and w n = w n -\w n -2 for n > 4, we can experimentally 
verify that the initial segments of the word W factorizes as W = wiW2W%w\ ■ ■ ■ w\. Such 
observations about the frequency sequence motivate us to define a labelled tree such that 
for each n, G(n) is the label of the parent of vertex n + 1. 

Table 2.1. First 20 terms of G(n) 
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Table 2.2 


First 20 terms of the frequency sequence f(n 
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This tree, denoted as Q, is defined recursively as follows. Q has a root vertex whose 
left subtree is a copy of Q. The root also has a right child which itself has a copy of Q 
as its only subtree. The definition is pictured in the left-half of Figure 12.11 The labeling 
procedure works by denoting the root of Q as vertex 1, and then labeling the vertices in 
increasing order of their height from the root. All vertices at a specific height are labelled 
in increasing order from right to left. The right half of Figure 12.11 shows Q labeled up to 
height four. 

Lemma 1. The tree Q contains i*X+2 vertices at height h. Consequently, the number of 
vertices in Q from height through h is i*X+4 — 2. 

Proof. From the definition of Q we note that vertices at height h of Q consists of vertices 
at height h — 2 of Q contributed by the right subtree of the root and vertices at height 
h — 1 contributed by the left subtree of the root. As such, the number of vertices at height 
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Figure 2.1. Q is defined on the left. To the right is Q labelled up to height 4. 



h satisfies the Fibonacci recursion with initial conditions F2 and F$ for heights and 1 
respectively. It follows that there are F^+2 vertices at height h and that the number of 
vertices from height to h is F2 + • • • + = Fh+4, ~ 2- We deduce the latter from the 
well-known Fibonacci identity F\ + ■ ■ ■ + F^ = Fh+2 — 1- See, for example, [TJ p. 2] where 
a combinatorial proof is provided. □ 

Remark 1. We could alter the tree Q to make it so that G(n) is the label of the parent of 
vertex n. To do this we need to insert a new vertex below the current root of Q and then 
attach it to the root. We then redo the labeling starting from the new vertex. However, 
there are certain advantages with the current definition. First, we think that it makes 
the recursive definition of Q more pleasant. Further, the definition as it stands will make 
some of the computations in the proof of the combinatorial interpretation easier and the 
exposition more clear. 

Theorem 2.1. Let g{n) denote the label of the parent of vertex n + 1 in the infinite tree 
Q . Then g{n) satisfies the recursion g(n) = n — g{g{n — 1)) with the initial value g(l) = 1. 

Proof. The proof is by induction on the vertex labels n. By comparing the values of G(n) 
from Table [2TT1 with the values of g(n) from Figure [27Lj we see that the theorem holds for 
the first 19 vertices, which consists of all vertices of Q up to height 4. Now suppose that 
vertex n + 1 is located at height h > 4 and that the theorem holds for all vertices in Q 
preceding n + 1. 

As h > 4, vertex n + 1 is either located on the left subtree of vertex 1 or on the sub- 
tree emanating from vertex 2. Both these subtrees are a copy of Q. Denote the subtree 
containing vertex n + 1 as Q' , and the subtree not containing n + 1 as Q" . If we now 
remove all the labels of Q' that it inherits from Q, and relabel Q' in the same manner as 
we labeled Q starting from 1, then vertex n + 1 of Q will receive a new label no + 1 on Q' 
with uq < n since no + 1 is located at a lower height in Q' . (Specifically, at height h — 1 of 
Q 1 if Q' is the left subtree of vertex 1 or at height h — 2 of Q' if it is the subtree of vertex 
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2.) As an example, consider n + 1 = 10 in which case Q' is the left subtree of vertex 1 and 
no + 1 = 5. We establish the recursive formula at index n through its validity at index no 
and by evaluating the differences (n+1)— (no+1), g(n)—g(no), and g(g(n—l))—g(g(no — l)). 

Evaluating (n + 1) — (no + 1): The difference (n + 1) — (no + 1) is the number of vertices 
in Q consisting of vertices 1, 2, and all the vertices preceding n+1 that are in Q" . Such 
vertices preceding n + 1 in Q" consist of all vertices from height to h — 2 of Q" . As Q" is 
isomorphic to Q, there are F^+2 — 2 vertices from height to h — 2 by Lemma [TJ Therefore, 

n + l = (n + l) + 2 + (i^ +2 -2). (2.1) 

Evaluating g(n) — g(no): To this end notice that the parent of vertex n + 1 in Q is the 
same as the parent of the vertex labelled no + 1 in Q', but its label differs between the 
two trees. Vertices 1 or 2 cannot be the parent as n + 1 is above height 4. The difference 
g(n) — g(no) is again given by accounting for vertices 1 and 2 in Q along with the vertices 
in Q preceding the parent of n + 1 and residing in Q" . Since the parent of vertex n + 1 is 
at height h — 1 of Q, the vertices in Q" that precede the parent of n + 1 range from height 
to h — 3 of Q" . Counting as before we deduce that g(n) — g(no) = -F/i+i- Thus, since 
n = no + Fh+2 from (|2.ip , we conclude that 

n - g(n) = n - g(n ) + F h . (2.2) 

Evaluating g(g(nl)) — g(g(riQ — 1)): We begin by showing that except for some instances 
vertex g{g{n — 1)) in Q is the same as the one labeled g(g(no — 1) in the separate labeling of 
G' . In this case we will be able to compute g(g{nl)) — g(g(riQ — 1)) readily. Let us assume 
that vertex n + 1 , considered as a vertex of Q situated in the subtree Q' , is not the first or 
last vertex at its height in Q' . 

Consequently, vertex n lies in Q' and points to the same vertex as the one labelled no in 
the separate labeling of Q' . As such g{n — 1) points to the parent vertex of no in Q' , which 
is labelled g{riQ — 1) in the separate labeling of Q' . This parent cannot be the last vertex 
at its height in Q\ for otherwise, it would be the common parent of vertices no and no + 1 
in Q' located at the height above. This would make no + 1 the last vertex at its height in 
Q\ contradicting the assumption that vertex n + 1 in G, pointing to the same vertex as the 
one separately labeled no + 1 in G', is not so. 

As vertex g(n — 1) of Q is not the last vertex at its height in the subtree Q' , vertex 
g(n — 1) + 1 of Q lies in the subtree Q' and points to the same vertex as the one separately 
labelled g(no + 1) in Q' . Hence the parent of vertex g{n — 1) + 1 in Q, lying at height h — 2 
of G, also lies in Q' and points to vertex g{g{riQ — 1)) in the separate labeling of Q' (recall 
that h > 4 so vertex g(g(no — 1)) is defined in Q'). This confirms that vertex g(g(n — 1)) 
in Q is the same as the vertex labeled g(g(no — 1)) in Q 1 . 

The difference g(g(n — 1)) — g(g(no — 1)) in labels again comes from vertices 1, 2 and 
all vertices in Q preceding g(g(n — 1)) that lie on Q" . Since vertex g(g(n — 1)) is at height 
h — 2 of Q, the vertices preceding g(g(n — 1)) that lie on Q" range from height to h — 4 
of that subtree. There are Fh — 2 of them by Lemma [T] and thus 

g(g(n - 1)) = g(g(n - 1)) + 2 + (F h - 2) = g(g(n - 1)) + F h . 
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By the induction hypothesis uq — g(no) = g(g(no — 1)), and so by (|2.2j) we get 
n - gin) = n - g(n ) + F h = g(g(n - 1)) + F h = g(g(n - 1)) . 

We now deal with the exceptional cases, assuming first that vertex n + 1 in Q is located 
on the left subtree of vertex 1 and at heigh h > 4. Deviating from the previous notation a 
bit, denote the left subtree containing n + 1 as Qi and the subtree emanating from vertex 2 
as Q r . Vertex n + 1 is at height h — 1 of Gi • If it is the first vertex at height h — 1 of Qi then 
vertex n is the last vertex at height h — 2 of Q r . So g{n — 1) is the last vertex at height 
h — 3 of Q r and so g{n — 1) + 1 is the first vertex at height h — 2 of Q\. Vertex g(g(n — 1)) 
is then the first one at height h — 3 of Q\. 

Meanwhile, when Qi is labeled separately as before, vertex n + 1 in Q will again point 
to the same vertex as some vertex labeled no + 1 in Qi. Then the vertex labeled uq in Qi 
becomes the last vertex at height h — 2 of Qi, which implies that g(no — 1) is the last vertex 
at height h — 3 of Qi, and g(no — 1) + 1 is thus the first vertex of Q\ at height h — 2. So 
vertex g(g(riQ — 1)) in Qi, the parent of g(no — 1) + 1, is the first vertex at height h — 3 
of Q\. It points to the same vertex as the one labelled g{g{n — 1)) in Q. The difference 
g(g(n — 1)) — g(g(no — 1)) is thus due to the same reasons as in the previous case. Thus 
from the induction hypothesis for no, we deduce as before that n — g{n) = g{g{n — 1)). 

When vertex n + 1 in Q is the last vertex at height at height h — 1 of Qi, vertex g(n — 1) 
is the common parent of vertices n and n + 1, situated as the last vertex at height h — 2 
of Q\. So g(n — 1) + 1 is the first vertex at height h — 2 of Q r making g{g{n — 1)) the first 
vertex at height h — 3 of Q r . On the other hand, with no + 1 defined as before, vertex 
g{g{n$ — 1)) in Q\ will be the first one at height h — 2 of Q\. Both vertices g(g(n — 1)) and 
9(g( n o — 1)) ar e located at height h — 1 of Q. 

One must be careful about calculating g(g(n— 1))— g(g(no — 1)) because vertex g(g(n—l)) 
of Q occurs before the vertex pointing to g{g{nQ — 1)) in Q. Although the label of the latter 
vertex on Qi, which is g{g{riQ — 1)), is less than g{g{n — 1)). In this situation we simply 
note that since vertex g{g{riQ — 1)) is the first one at height h — 2 of Qi, its label is 
F2 + • • • + Fh—i + 1 = Fh+i — 1. Similarly, as g{g{n — 1)) is the first vertex at height h — 1 
of g, its label is F h+2 - 1. Therefore g(g(n - 1)) - g(g(n - 1)) = F h+2 - F h+1 = F h . As 
before it follows from the induction hypothesis on no that n — g(n) = g{g{n — 1)). 

When vertex n + 1 is located on the subtree g r , the exceptional cases are analogous to 
the previous ones. The case for vertex n + 1 being the last one at height h — 2 of g r is 
analogous to the case when n + 1 is the first vertex at height h — 1 of g\. When vertex 
n + 1 is the first vertex at height h — 2 of g r the situation is analogous to when n + 1 is 
the last vertex at height h — 1 of Q\. This completes our induction and the proof of the 
theorem. □ 

We now present a corollary which is the simplest case of Meek and Van Rees's result on 
how G(n) acts on positive integers when written in their Zeckendorf representation. Recall 
from the introduction that the authors showed if n = F ri + • • • + F r . is the Zeckendorf 
representation of n then G(n) = F ri —i + • • • + F r .—i. We now show this for n = F r , that 
is, for the Fibonacci numbers. With more work one could get the whole result from the 
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combinatorial interpretation but we will not pursue that direction. Instead we will prove 
the factorization property of the word HnLi f( n ) that we observed earlier experimentally, 
and which led to our discovery of the combinatorial interpretation for G(n). 

Corollary 2.2. g{F n ) = F n —i for n > 2, where F n is the n th Fibonacci number. 

Proof. For n > 4, the second vertex at height n — 3 is labeled F^ + • • • + F n _2 + 2 = F n . 
Its parent is the first vertex at height n — 4 with label F n ^\ — 1. Thus the parent of vertex 
F n + 1 is the second vertex at height n — 4 since the first one has vertices F n — 1 and F n 
as its children. Hence g(F n ) = F n _i. For 2 < n < 4, one can verify the claim from Figure 
EU □ 

Corollary 2.3. Define words w\ = 2,W2 = 1,103 = 2 = w\ and w n = w n -2W n -\ for 
n > 3. Let f{n) denote the frequency sequence of g(n). The infinite word W = n^Li f( n ) 
factorizes as W = w\W2 11^3 w n- 

Proof. The combinatorial interpretation makes it clear that g(n) is slow-growing, and The- 
orem 12.11 implies that the frequency sequence of G(n) from (jl.ip is the same as that of 
g(n). We note that f(n) is the number of children of vertex n in Q. Indeed, if vertex 
n has children labeled from a to b then g(k) = n precisely when k ranges from a + 1 to 
6+1. Let Wh denote the word produced by concatenating from left to right the values 
of f(n) as n ranges in increasing order over the vertices at height h of Q. For h > 1, the 
vertices at height h of Q are arranged by placing the vertices at height h — 2 of Q to the 
right of the vertices at height h — 1 of Q, following from the recursive definition of Q. Thus 
Wh = Wh-iWh-x with Wo = 2 and W\ = 12. We show that Wh = Wh+iWh+2 for h > 0. 
We have that W\ = 12 = ^2^3 and W2 = 212 = ^3^4. Assuming that Wh = Wh+iWh+2 for 
1 < h < N, we get that Wn = Wn-2Wn-i = wn-iWnwnwn+i = wn+iWn+2 where the 
last equality uses the recursive definition of w n . The claim follows by induction. Finally, 
to finish the proof we note that Wq = 2 = w\ and so 

00 00 00 00 00 

n f( n ) = n Wh = w yi w h = w\ n w h+ xw h+ 2 = wiw 2 wi. 

n=l h=Q h=l h=l h=3 

□ 

The consequence of Corollary 12.31 is that it allows us to view the word W as two inter- 
twined copies of itself along with the initial seeds w\ and ui2- More precisely, the above 
factorization shows that W = u>iW2 Il^Ls w n = ^3 11^=2 w nW n +i = w s Yl^ =2 w n +2 = 
YY^L^Wn. What we hope is that one can find analogous factorizations of the frequency 
sequences generated by the A;-fold recursions that was discussed in the introduction. In 
this manner one may find corresponding infinite trees for the A:-fold recursions and proceed 
to derive a combinatorial interpretation. 
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